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Abstract 

The contribution of both inclusive and exclusive cross-section data from HERA 
I to our knowledge of parton distribution functions is reviewed and future prospects 

O ' are outlined. 

o 

(N 

1 Introduction 

O 

Q ' The kinematics of lepton hadron scattering is described in terms of the variables Q^, the invariant mass 

of the exchanged vector boson, Bjorken x, the fraction of the momentum of the incoming nucleon taken 
by the struck quark (in the quark-parton model), and y which measures the energy transfer between the 
lepton and hadron systems. The differential cross-section for the neutral current (NC) process is given in 



p terms of the structure functions by 



^ ; dxdQ"^ Q^x 

where Y± = 1 it (1 — y)^. The structure functions F2 and xF^ are directly related to quark distributions, 
. and their dependence, or scaling violation, is predicted by pQCD. For low x, x < 10~^, F2 is 

O ! sea quark dominated, but its evolution is controlled by the gluon contribution, such that HERA 

^ ■ data provide crucial information on low-x sea-quark and gluon distributions. At high Q^, the structure 

function XF3 becomes increasingly important, and gives information on valence quark distributions. The 
charged current (CC) interactions also enable us to separate the flavour of the valence distributions at 
high-x, since their (LO) cross-sections are given by, 
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Parton Density Function (PDF) determinations are usually global fits [1-3], which use fixed target 
DIS data as well as HERA data. In such analyses, the high statistics HERA NC e'^p data have deter- 
mined the low-x sea and gluon distributions, whereas the fixed target data have determined the valence 
distributions. Now that high-Q^ HERA data on NC and CC e^p and e^p inclusive double differential 
cross-sections are available, PDF fits can be made to HERA data alone, since the HERA high cross- 
section data can be used to determine the valence distributions. This has the advantage that it eliminates 
the need for heavy target corrections, which must be applied to the z/-Fe and jiD fixed target data. Fur- 
thermore there is no need to assume isospin symmetry, i.e. that d in the proton is the same as u in the 
neutron, since the d distribution can be obtained directly from CC e^p data. 

The HI and ZEUS collaborations have both used their data to make PDF fits [4]. In Section |2] we 
review the published PDF analyses paying particular attention to the treatment of correlated systematic 
errors. In Section [3] we present the preliminary results of a combination of ZEUS and HI data. In 
SectionlHwe discuss the improvement in our knowledge of the gluon PDF, which comes from the addition 
of jet data to the PDF fits, and we present the measurements of ag which have been made using HERA 
jet data. In Section |5] we present preliminary fits using HERA-II data and in Section [6] we conclude by 
looking at the propsects for the future. 



2 Comparing ZEUS and HI published PDF analyses 

Full details of the analyses are given in the relevant publications, in this contribution we examine the 
differences in the two analyses, recapping only salient details. For both HERA analyses the QCD pre- 
dictions for the structure functions are obtained by solving the DGLAP evolution equations [5] at NLO 
in the MS scheme with the renormalisation and factorization scales chosen to be . These equations 
yield the PDFs at all values of provided they are input as functions of x at some input scale Q\. 
The resulting PDFs are then convoluted with coefficient functions, to give the structure functions which 
enter into the expressions for the cross-sections. For the ZEUS analysis, the coefficient functions are 
calculated using the general-mass variable flavour number scheme of Roberts and Thorne [6]. For the 
HI analysis, the zero-mass variable flavour number scheme is used. 

The HERA data are all in a kinematic region where there is no sensitivity to target mass and 
higher twist contributions, but a minimum cut must be imposed to remain in the kinematic region 
where perturbative QCD should be applicable. For ZEUS this is > 2.5 GeV^, and for HI it is 
> 3.5 GeV^. Both collaborations have included the sensitivity to this cut as part of their model 
uncertainties. 

In the ZEUS analysis (called the ZEUS-JETS fit), the PDFs for u valence, xuv{x), d valence, 
xdv{x), total sea, xS{x), the gluon, xg{x), and the difference between the d and u contributions to the 
sea, x{d — u), we each parametrized by the form 

-x)P3P(x), (1) 

where P{x) = l+p^x, at Qq = 7GeV^. The total sea xS = 2x{u+d+s+c+b), where q = qsea for each 
flavour, u = Uy+Usea, d = dy + dsea and q = qsea for all other flavours. The flavour structure of the light 
quark sea allows for the violation of the Gottfried sum rule. However, there is no information on the shape 
of the d — u distribution in a fit to HERA data alone and so this distribution has its shape fixed consistent 
with the Drell-Yan data and its normalisation consistent with the size of the Gottfried sum-rule violation. 
A suppression of the strange sea with respect to the non-strange sea of a factor of 2 at Qq, is also imposed 
consistent with neutrino induced dimuon data from CCFR. Parameters are further restricted as follows. 
The normalisation parameters, pi, for the d and u valence and for the gluon are constrained to impose 
the number sum-rules and momentum sum-rule. The p2 parameter which constrains the low-x behaviour 
of the u and d valence distributions is set equal, since there is no information to constrain any difference. 
In the present fits to HERA-I data it is also necessary to constrain the high-x sea and gluon shapes, 
because HERA-I data do not have high statistics at large-x, in the region where these distributions are 
small. The sea shape has been restricted by setting P4 = for the sea, but the gluon shape is constrained 
by including data on jet production in the PDF fit, as discussed in Sec. IH Finally the ZEUS analysis 
has 1 1 free PDF parameters. ZEUS have included reasonable variations of these assumptions about the 
input parametrization in their analysis of model uncertainties. The strong coupling constant was fixed to 
asiM^) = 0.118 [7]. Full account has been taken of correlated experimental systematic errors by the 
Offset Method, as described in ref [3, 8]. 

For the HI analysis (called the HI 2000 PDF fit), the value of = 4GeV2, and the choice of 
quark distributions which are parametrized is different. The quarks are considered as n-type and d-type 
with different parametrizations for, xU = x{uy + Usea + c), xD = x{dy + dgea + s), xU = x{u + c) and 
xD = x{d + s), with qsea = Q, as usual, and the the form of the quark and gluon parametrizations given 
by Eq.[T] For xD and xU the polynomial, P{x) = 1.0, for the gluon and xD, P{x) = (1 +p4x), and for 
xU,P{x) = {l+p/^x+p^x^). The parametrization is then further restricted as follows. Since the valence 
distributions must vanish as x ^ 0, the low-x parameters, pi and p2 are set equal for xU and xU, and for 
xD and xD. Since there is no information on the flavour structure of the sea it is also necessary to set p2 
equal for xtj and xD. The normalisation, pi, of the gluon is determined from the momentum sum-rule 
and the p^ parameters for xU and xD are determined from the valence number sum-rules. Assuming 
that the strange and charm quark distributions can be expressed as x independent fractions, fg and /c, of 



the d and u type sea, gives the further constraint pi{U) = pi{D){l — fs)/{^ — fc)- Finally there are 10 
free parameters. HI have also included reasonable variations of these assumptions in their analysis of 
model uncertainties. The strong coupling constant was fixed to as(M|) = 0.1185 and this is sufficiently 
similar to the ZEUS choice that we can rule it out as a cause of any significant difference. Full account 
has been taken of correlated experimental systematic errors by the Hessian Method, see ref. [8]. 

The different treatments of correlated experimental systematic errors deserves a little more di- 
cussion since modem deep inelastic scattering experiments have very small statistical uncertainties, so 
that the contribution of systematic uncertainties becomes dominant and consideration of point to point 
correlations between systematic uncertainties is essential. 

For both ZEUS and HI analyses the formulation of the including correlated systematic uncer- 
tainties is constructed as follows. The correlated uncertainties are included in the theoretical prediction, 
Fi{p,s), such that 

A 

where, F^^'~'^'^^ (p), represents the prediction from NLO QCD in terms of the theoretical parameters p, 
and the parameters sx represent independent variables for each source of systematic uncertainty. They 
have zero mean and unit variance by construction. The symbol A^^^ represents the one standard deviation 
correlated systematic error on data point i due to correlated error source A. The is then formulated as 

2 _ [F'iiP, s) - Fijmeas)]^ , ,2 

^ - 2^ T2 + 2^ ■Sa 
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where, Fi (meas) , represents a measured data point and the symbol cjj represents the one standard devia- 
tion uncorrected error on data point i, from both statistical and systematic sources. The experiments use 
this in different ways. ZEUS uses the Offset method and HI uses the Hessian method. 

Traditionally, experimentalists have used 'Offset' methods to account for correlated systematic 
errors. The is formulated without any terms due to correlated systematic errors (sx = in Eq.|2ll for 
evaluation of the central values of the fit parameters. However, the data points are then offset to account 
for each source of systematic error in turn (i.e. set sx = +1 and then sx = —1 for each source A) and a 
new fit is performed for each of these variations. The resulting deviations of the theoretical parameters 
from their central values are added in quadrature. (Positive and negative deviations are added in quadra- 
ture separately.) This procedure gives fitted theoretical predictions which are as close as possible to the 
central values of the published data. It does not use the full statistical power of the fit to improve the 
estimates of sx, and thus it is a more conservative method of error estimation than the Hessian method. 

The Hessian method is an alternative procedure in which the systematic uncertainty parameters sx 
are allowed to vary in the main fit when determining the values of the theoretical parameters. Effectively, 
the theoretical prediction is not fitted to the central values of the published experimental data, but these 
data points are allowed to move collectively, according to their correlated systematic uncertainties. The 
theoretical prediction determines the optimal settings for correlated systematic shifts of experimental data 
points such that the most consistent fit to all data sets is obtained. Thus, in a global fit, systematic shifts 
in one experiment are correlated to those in another experiment by the fit. In essence one is allowing 
the theory to calibrate the detectors. This requires confidence in the theory, but more significantly, it 
requires confidence in the many model choices (such as the parametrization at Qq) which go into setting 
the boundary conditions for the theory . 

To compare these two methods the ZEUS analysis has been performed using the Hessian method 
as well as the Offset method and Fig. [T] compares the PDFs, and their uncertainties, using these two 
methods. The central values of the different methods aie in good agreement but the use of the Hessian 
method results in smaller uncertainties, for a the standard set of model assumptions. However, model 
uncertainties are more significant for the Hessian method than for the Offset method. The PDF parame- 
ters obtained for different model choices can differ by much more than their experimental uncertainties. 




Fig. 1: PDFs at — lOGeV^, for the ZEUS analysis comparing the Offset and the Hessian methods. 
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Fig. 2: Left plot: Comparison of PDFs from ZEUS and HI analyses at = lOGeV^. Right plot: Comparison of gluon from 
ZEUS and HI analyses, at various . Note that the ZEUS analysis total uncertainty includes both experimental and model 
uncertainties. 



because each model choice can result in somewhat different values of the systematic uncertainty param- 
eters, sx, and thus a different estimate of the shifted positions of the data points. This results in a larger 
spread of model uncertainty than is found in the Offset method, for which the data points cannot move. 
Thus when the total uncertainty from both experimental and model sources is computed there is no great 
difference between these two aproaches. 

Fig. |2] compares the results of the HI and ZEUS analyses and illustrates the comparability of the 
ZEUS (Offset) total uncertainty estimate to the HI (Hessian) experimental plus model uncertainty esti- 
mate. Whereas the extracted PDFs are broadly compatible within errors, there is a noticeable difference 
in the shape of the gluon PDFs. This can be traced to small but significant differences in the slope 
of low-Q^ data. Thus there could be an advantage in combining ZEUS and HI data into a single data 
set [9], not just in terms of reducing statistical errors, but also in reducing systematic errors by using each 
experiment to calibrate the other. 

3 Combining ZEUS and HI HERA-I data 

Essentially, since ZEUS and HI are measuring the same physics in the same kinematic region, one can 
try to combine them using a 'theory-free' Hessian fit in which the only assumption is that there is a true 
value of the cross-section, for each process, at each x, point. The systematic uncertainty parameters, 
s\, of each experiment are fitted to determine the best fit to this assumption. Thus each experiment is 



HERA I e*p Neutral Current Scattering - HI and ZEUS 




Fig. 3: Left hand side: HERA-I combined data on ct,. as a function of for NC e^p scattering, together with fixed target data, 
for X bins across the whole measured kinematic plane. Right hand side: HI, ZEUS and HERA-I combined data on ct,. for NC 
e+p scattering for low, middling and high-a;. 

calibrated to the other. This works well because the sources of systematic uncertainty in each experiment 
are rather different. Once the procedure has been performed the resulting systematic uncertainties on 
each of the combined data points are significantly smaller than the statistical errors. Fig. [3] shows the 
NC e^p reduced cross-sections from the HERA combination and compares the individual HI and ZEUS 
results with those of the combination so that the scale of the improvement can be appreciated.. 

4 Adding exclusive jet cross-section data to PDF fits, and measurements of Ug {M-^ ) 

The gluon PDF contributes only indirectly to the inclusive DIS cross sections, through the scaling viola- 
tions. However it makes a direct contribution to jet cross sections through boson-gluon and quark-gluon 
scattering, so that measurements of these cross sections can constrain the gluon density. Furthermore, 
the addition of the jet production data allows an accurate determination of as{Mz) to be made in a 
simultaneous fit for as{Mz) and the PDF parameters. 

In the ZEUS-JETS PDF fit, ZEUS neutral current e'^p DIS inclusive jet cross sections and direct 
photoproduction dijet cross sections have been used to constrain the gluon. The predictions for the 
jet cross sections were calculated to NLO in QCD using the programme of Frixione and Ridolfi [10] 
for photoproduced dijets and DiSENT [11] for jet production in DIS. These calculations are too slow 
to be used iteratively in the fit. Thus, they were used to compute LO and NLO weights, a, which 
are independent of as and the PDFs, and are obtained by integrating the corresponding partonic hard 
cross section^ in bins of ^ (the proton momentum fraction carried by the incoming parton), (the 
factorisation scale) and (the renormalisation scale). The predictions for the NLO QCD cross sections 
are then obtained by folding these weights with the PDFs and according to the formula 

^ = EE E fM).. M,) ■ a-{M,) ■ , (3) 

where the three sums run over the order n in as, the flavour a of the incoming parton, and the indices 
k) of the and ijlr bins, respectively. This procedure reproduces the NLO predictions to better 
than 0.5%. 

'For the dijet photoproduction cross sections, the weights also included the convolution with the photon PDFs. 
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Fig. 4: Left hand side: the sea and gluon PDFs from the ZEUS-JETS fit for various Q values. Right hand side:The fractional 
error on the gluon PDF for various values, for PDF fits made with and without including the jet data 
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Fig. 5: The profiles vs. Qs (Mz) for ZEUS PDF fits made with and without including the jet data 



The cross-section predictions for photoproduced jets are sensitive to the choice of the input photon 
PDFs. The AFG photon PDF [12] was used in the fits, but in order to minimise sensitivity to this choice, 
the analysis was restricted to use only the 'direct' photoproduction cross sections. These are defined by 
the cut x°^^ > 0.75, where x'^^ is a measure of the fraction of the photon's momentum that enters into 
the hard scatter. 

Fig ID shows that the jet data constrain the gluon mainly in the range 0.01 ;$ ^ ^ 0.4, although 
the momentum sum-rule ensures that the indirect constraint of these data is still significant at higher x. 
The decrease in the uncertainty on the gluon distribution is striking; for example at = 7 GeV^ and 
X = 0.06 the uncertainty is reduced from 17% to 10%. A similar decrease in uncertainty by a factor of 
about two is found in this mid-x range, over the full range. 

The value of as{Mz) is fixed in most PDF fits but a simultaneous fit for as{Mz) and the PDF 
parameters can be made. Such fits to inclusive cross-section data do not yield accurate values of as{Mz) 
because of the strong correlation between as{Mz) and the gluon shape which comes from the DGLAP 
equations. However including jet data in the fit provides additional constraints. In the ZEUS-JETS fit 
with free as{Mz) the value 

asiMz) = 0.1183 ± 0.0027(exp.) 

is obtained. Figure [5] illustrates the improved accuracy of the extraction of as{Mz) due to the inclusion 
of the jet data. The profile around the minimum is shown as a function of as{Mz) for the ZEUS -JETS 
fit with as free, and a similar fit in which the jet data are not included. 

There have also been accurate determinations of as{Mz) using HERA jet data independent of 
PDF fits and a combined ZEUS and HI as{Mz) extraction has been made [13]. Fig. [6] compares this 
combined value to those of the individual experiments and to the world average, and also illustrates the 
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Fig. 6: Left hand side: as as a function of Q determine from HERA data. Riglit iiand side: as(Mz) values from HI, ZEUS 
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Fig. 7: ZEUS NC e p data from HERA-II running witii polarised beams. The predictions of the ZEUS-pol fit are superimposed 
running of Os with as determined from the HERA experiments. 
5 Adding HERA-II data to PDF fits 

The determinations of the valence PDFs from HERA-I data are not as accurate as those from global fits, 
but this is rapidly improving with the addition of HERA-II data. Fig|7]shows ZEUS e^pNC data from the 
2004-6 running period with polaiised beams. There are 105^6^^ of negatively polarised, Pe = —0.27, 
data and 7l.8pb^^ of positively polarised, Pg = +0.30, data. This data, and e~p CC data from 2004- 
5, have been input to the ZEUS-JETS fit analysis framework and this new fit is called the ZEUS-pol 
fit [14]. The polarization of the data has been exploited to measure the neutral current vector and axial 
vector couplings [14]. The results of this ZEUS-pol fit are superimposed on the data in Fig|7] The PDFs 
extracted from the ZEUS-pol fit are compared to those of the ZEUS -JETS fit in Fig [8] The central values 
of the fit are very compatible with the ZEUS -JETS fit, and the n- valence quark uncertainty is reduced 
significantly at large x. The improvement is mostly in the n-valence quark at present because the e^p 
data are u quark dominated at large x. We can expect improvements in the d-valence distribution when 
the final e'^p CC HERA-II data become available. 



6 The future 



We conclude with a look to the future. HERA data will continue to improve our knowledge of PDFs for 
the next few years. Firstly, there is more jet data both from HERA-I [15-17] and from ~ 500pb^^ of 
HERA-II [18, 19] analyses, as shown in Fig|9l Inputting these data should improve determinations of the 
high-x gluon. 

It is also interesting to investigate the low-x gluon, where the theoretical formalism of the NLO 
DGLAP equations may need extending to account for ln{l/x) resummation [20-22] or even non-linear 
terms [23]. Fig. H shows the gluon and the sea PDFs predicted by the ZEUS -JETS fit. For > TGeV^ 
the gluon PDF is larger than and steeper than the sea PDF, but for lower it flattens and even becomes 
valence-like. This counter intuitive behaviour may come from the use of the DGLAP equations outside 
their region of applicability. At low x the form of the DGLAP equations is such that one has F2 ~ xq 
and ^f^Qi ~ PqgXg. The determination of the gluon distribution is coming from the measurement of the 
scaling violations, dF2/d\n Q^, but these may be determined by either the gluon density or the splitting 
function. Thus the odd behaviour of the gluon may in fact derive from use of an incorrect splitting 
function. The use of a calculation of the low-x splitting functions which includes ln{\/x) resummation 
results in a steeper gluon PDF [20]. To settle these ambiguities definitively we need a measurement 
of the gluon density at small x which does not derive from the scaling violations of F2, for example a 
measurement of Fl or Fg'^, FI'' . 

So far the addition of charm data to PDF fits has made little impact [24] but there is new data 
on F^"^ from ZEUS, using D production from 82pb^^ of HERA-I running [25], and using D* (and D) 
production from 162p6^^ [26] (and 135pb^^ [27]) of HERA-II running. There is also HI data on Fg'^ 
from 54p6~^ [28] of HERA-II data and these have been averaged together with the HERA-I data. Both 
collaborations have also extracted F2''', HI using the same data sample as for their charm extraction and 
ZEUS using 39pb-^ of HERA-II data [29]. These data are shown in Fig. [TOl In principle heavy quark 
data should give information on the gluon distribution since heavy quarks are generated by (7 —> cc 
and g — > bb. However, at the present time there is some theoretical disagreement about heavy quark 
production schemes [30-32] such that these data may tell us more about the correct treatment of heavy 
quarks than about PDFs. 

The structure function Fl depends strongly on the gluon [33]. A model independent measurement 
of Fl requires data at different beam energies so in 2007 HERA was run at proton beam energies 460 
GeV and 575 GeV. Fl only makes strong contributions to the cross-section at high y, and measurements 
at high-y require technically challenging identification of low energy scattered electrons. Both collab- 
orations have been preparing for this challenge by extending their measurement capabilities to high y 
using the nominal energy HERA-I and HERA-II running. Fig [TT] shows data at high-y from ZEUS 
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Fig. 9: Upper plots: ZEUS jet data from; (left) inclusive DIS (HERA-I, 82pb^^), (middle) di-jet photoproduction (HERA- 
I, 82pb^^), (right) di-jet DIS (HERA-II, 209p6~^). Lower plots: HI normalised inclusive jet cross-sections (HERA-I and 
HERA-II, 320p6~^) 
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Fig. 10: Heavy quark production data compared to predictions from different PDF sets using different heavy quark production 
schemes. Left: Ff data from HERA. Right:F|'' data from HERA. 
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Fig. 11: HERA data at high y from: (left) ZEUS HERA-II, (middle) HI HERA-I at low and (right) HI HERA-II at high-Q^ 

HERA-II 2006 running [34], and from HI HERA-I running, at low-Q^ [35], and HERA-II running, at 
high-Q^ [36]. These data not only pave the way for measurement of Fi, they are also interesting in their 
own right since they access a new kinematic regime. Thus we look forward to exciting new information 
on hadron structure from these measurements in the near future. 
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